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THE LARGE SIEVE, MONODROMY AND ZETA FUNCTIONS OF 

CURVES 

E. KOWALSKI 


Abstract. We prove a large sieve statement for the average distribution of Frobenius conju- 
gacy classes in arithmetic monodromy groups over finite fields. As a first application we prove 
a stronger version of a result of Chavdarov on the “generic” irreducibility of the numerator of 
the zeta functions in a family of curves with large monodromy. 


1. Introduction 

In [0], N. Chavdarov proves that, in an algebraic family C —> U of smooth projective curves 
of genus g over a finite field Eg, if the monodromy groups mod i of the family are “as large as 
possible” for almost all then the numerators det(l — TFr | Q^)) of the zeta functions 

of the curves Cu of the family are “almost all” irreducible, and in fact have splitting field as 
large as compatible with the existence of the symplectic intersection pairing. 

Chavdarov’s method, as sketched in the introduction to is analogue in principle to the 
method used by van der Waerden to show that “most” polynomials of given degree d with 
integer coefficients have splitting field as large as possible. This latter result was reproved in 
a simpler way and stronger form by Gallagher using the large sieve inequalities as a new 
analytic tool. This suggests trying to apply similar ideas to Chavdarov’s problem. In this paper, 
we show that this is indeed possible, to some extent. This proof also yields a much stronger 
result than [^1 in many cases; see Theorem 16.1 l and Theorem lOl for the exact statements. The 
proof uses some of the same tools, together with deep ideas of analytic number theory and some 
new uniform estimates for Uadic Betti numbers which may be of independent interest. 

The plan of this paper is the following; in the next two sections, we introduce the data 
involved and then state our main bilinear form estimate from which we derive a “large sieve” 
statement, essentially in the same way as the classical case. In the next two sections we prove 
the bilinear form estimate. First, Section 0 establishes some useful estimates for sums of Uadic 
Betti numbers of “Artin-like” sheaves in various circumstances (restricted unless the base space 
is a curve). The proofs rely on the methods used by Katz in [K2j and IKH ” the difference being 
the parameters for which uniformity is required. Then in Sectional we conclude the proof. 

In the final sections we apply the sieve statement to prove our form of Chavdarov’s Theorem. 
The statements at least are accessible (and of some interest) without knowledge of the techniques 
of etale cohomology required for the proof. Families of a fixed genus can be treated pretty 
quickly, but we expand some effort to obtain in some cases a uniform result that can give 
information about curves of genus g over Eg when q and g are simultaneously large (although 
g must be much smaller than q). We also draw a few easy consequences fProposition 16.^1 and 
Proposition lb.5j) . as illustrations of results which seem out of reach of Chavdarov’s method, but 
are is not meant as really important results in themselves.^ 

One can hope that other applications of this method will arise, by analogy with the situation 
in analytic number theory, where the ideas surrounding the large sieve have been extremely 
successful since the original discovery by Linnik. 
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^ For analytic number theorists, let us mention that, incidentally, we also get a version of Gallagher’s esti¬ 
mate m uniform in terms of the degree, see the Hnal remark in Section Q 
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The related paper |Ko2j applies Chavdarov’s Theorem and some extra ingredients to the 
study of the characterization of abelian varieties over finite fields or number fields by their 
torsion fields. Also in we use the Betti number estimates and a uniform Chebotarev 

density theorem to study the density of quadratic twists of elliptic curves over function fields 
over finite fields with rank ^ 2 . 

Acknowledgments. N. Katz found a serious mistake in the first version of Section|ll thanks 
for pointing it out and for other enlightening remarks. 

Background references. Since we are using two important themes in number theory which 
may not be equally known to the reader, we mention a few general references. For the large 
sieve, the reader may consult [Q, |Moj or |IK1 Ch. 7]. For the approach to exponential sums via 
f-adic methods and their applications (with which the author, for one, is not so well acquainted), 
we suggest jEHU Ch. 9], Ch. 2,3], [Ea Ch. 1-3], ina Sommes trig.], or [Kl § 11 . 11 ]. 

Notation. As usual, jXj denotes the cardinality of a set, &g is the symmetric group on g 
letters. By f g for x € X, or f = 0{g) for x € X, where X is an arbitrary set on which / 
is defined, we mean synonymously that there exists a constant C ^ 0 such that l/(x)j ^ Cg{x) 
for all X € X. The “implied constant” is any admissible value of C. It may depend on the set 
X which is always specified or clear in context. 

2. Preliminaries 

Our main tool is a general estimate for a bilinear form made up from representations of a 
system of lisse F^-sheaves on a variety over a finite field. 

The first basic data is therefore a base variety U/Fq, where as usual Fg denotes a finite field 
of characteristic p with q elements. We assume that U is smooth, affine, and geometrically 
connected of dimension d ^ 1. 

We denote by fj the geometric generic point of U and by U the variety U extended to Fg. 
We therefore have the arithmetic fundamental group 7 ri(f 7 , r/) and the geometric fundamental 
group 7ri{U,T]). Those fit in an exact sequence 

(2.1) 1 -, 7ri{U,rj) -. vri([/,r/) Gal(Fg/Fg) ~ Z -^ 1. 

For n ^ 1 and u £ U (Fg^), we denote by Fr^^g^ the geometric Frobenius automorphism at u 
in 7ri(17, ry), i.e., the image of the inverse of the canonical generator x of the Galois group 

of Fqn via the map 

Gal(Fgn/Fgn) ^ 7ri(f7,r/) 

induced from the inclusion SpecFgn —> U which “is” u. In the above exact sequence we have 
then 

d(Fr„,g") = -n. 

In most of our results, the base field (i.e., q) will be considered fixed, althought the results 
will be uniform in q so they can be applied to U x Fgn for any n ^ 1. So most of the time we 
just write Fr„ instead of Fr„^g for u £ t/(Fg). 

We also denote generically by Fr the global geometric Frobenius automorphism, acting for 
instance on Gadic cohomology groups. 

We now come to the sheaves on U that we consider. We assume given a set A of primes 7 ^ p, 
and for ^ £ h., a, lisse sheaf Xi of F^-vector spaces of (fixed) rank r ^ 1, where Fa is a finite 
field of characteristic I (the degree of which over F^ may depend on €). The basic example is 
when we have lisse sheaves Xi of ZA-modules and 

Xi = TijraxTi, 

where Z^ is the ring of integers in a finite extension of with residue field F^ and maximal 
ideal ttia. However, we do not assume that Ti is of this type (of course, it will be in most 
applications). 
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Equivalently (and this is the most convenient viewpoint in terms of a first understanding at 
least), “is” a representation 

Pi : 7ri(17,?7) ^ GL(r, Fa). 

From this description we can easily define the monodromy groups of J-i, or of pf the arithmetic 
monodromy group Gi C G'L(r, Fa) is the image of pi, and the geometric monodromy group 
is the image of the subgroup TTi{U^rj). Thus from (|2.1 II we derive a commutative diagram with 
exact rows and surjective downward arrows: 

1 -> Tii{U,rf) -> Tii{U,ri) —^ Z -> 1 

(2.2) I I ^ 

1-> Gl -. G, r,-. 1, 

where is a finite commutative (cyclic) group. 

In the case where the sheaves Ti arise by reduction of ZA-sheaves as described previously, 
one says that they form a compatible system if for every extension Yqn /Fg, every u £ U{Fqn) and 
every i € A, the reversed characteristic polynomial of Fr^^gn acting on IFi, i.e., the polynomial 

det(l - TFr^^gn | 

has coefficients in Q and is independent of i. 

For any i we will consider various sums involving irreducible (complex valued) linear repre¬ 
sentations of Gi- For reasons that will become clear during the proof of the main bilinear form 
estimate (a certain phenomenon of “imprimitivity”), we can not use all representations, but 
must ensure that those used are suitably orthogonal when restricted to the subgroup G^. 

For this we have the following lemma. 

Lemma 2.1. (1) Let n, tt' be irreducible linear representations of G^. Then vr and n' are 
equivalent when restricted to Gf if and only if there exists a character tp G Ti, the character 
group ofTi, such that 

TV = tt' ® [ip o m). 

(2) If TT and tt' are not equivalent restricted to G^, the representation tt (g) tt' restricted to G^ 
does not contain the trivial representation, where tt' is the contragredient of tt' . Otherwise it 
contains the trivial representation with multiplicity equal to |r^| where 

tf = {pj G t£ I TT ~ TT (g) (V' o m)}. 

Proof. We will identify characters Tp gT£ with characters of G^ by tpix) = 'ip{m{x)) for x G G^. 

For any representation r of G^, let j(r) denote the multiplicity of the trivial representation 
in the restriction of r to G^. This is given by 

= i^{'m{x))TvT{x) 

= '>P{x)Tit{x) 

(by orthogonality of characters of T^), which is the sum of the multiplicities of the characters 
Tp of G£ in r. (This interpretation being of course also available by a simple application of 
Frobenius reciprocity). 

Applying this to r = tt (g) it', it follows that 1 is contained in the restriction of r to G^ if 
and only if there exists a Tp such that ip is contained in vr (g) it'. However if that is the case, the 
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trivial representation is contained in vr 0 but as vr and tt'iIj are irreducible, this means that 
TT ~ tt' (8) V’ as representations of Gi. This shows the “only if” part of the lemma, and the other 
direction is trivial since ■0 restricts to the trivial character of G^. 

The first part of (2) is contained in the previous paragraph. The assertion about the multi¬ 
plicity is also clear: if tt' = vr^V’o is a twist of vr by a character tpQ, the multiplicity j(7r(8)vr') is the 
sum of multiplicities of the characters ■0 in each of which is equal to 1 if vr ~ tt (8) {'i/j'iIjq), 

and 0 otherwise, i.e, it is equal to 1 if '0'0o £ and 0 otherwise. So the total multiplicity is 
the number of elements in TJ. □ 

Remark 2.2. If tt ~ vr (8) 0, we must have 0(x) = 1 whenever Tr7r(x) 0. But if deg(7r) > 1, it 
is well-known that there are elements x £ Gi with Tr7r(x) = 0, and then the value of 0 is not 
determined. Of course, “in general”, we have T^ = 1, but (for instance), for any representation 
TT of degree 2 of a dihedral group n even (of order 2n), there is a character 0 with tt ~ 7r(8)0. 

Say that two representations Gi are geometrically equivalent if their restrictions to G^ are 
equivalent, or (by the lemma) if and only if they differ by a twist by a character of T^. We 
now assume chosen a set of representatives of the irreducible representations of Gi for this 
equivalence relation. Using these and characters of T^ one can parameterize all irreducible 
representations of Gi as follows: they are of the form vr (8> 0 where vr £ and 0 £ T^; the 
representation tt is unique, but 0 is only unique up to multiplication by an element of the group 
TJ defined in (2) of the previous lemma. 

This ambiguity requires us to control the size of those groups TJ. We will assume that for 
all £ £ A and tt £ II^, we have 

(2.3) |f^| 0 K 

for some fixed k 0 1. 

Here are useful cases when we can get such a bound. 

Lemma 2.3. (1) Assume that for all I we have G^ = SL{r,Fi). Then H2.3I1 holds with k = r. 

(2) Assume that r is even and that for all i we have Gj = Sp{r,Fi), the symplectic group 
for some non-degenerate alternating form (•,•) on F^, and that Gi is a subgroup of the group 
SSp{r,Fi) of symplectic similitudes, i.e., for g £ Gi we have {gv,gw) = m{g){v,w) for some 
m{g) £ F^, called the multiplicator of g. Then (12.311 holds with k = 2. 

Proof. (1) If TT is an irreducible representation of Gi and 0 £ T^, then 0 is trivial on the center 
Zi of Gi. For any x € Gi, we can write 

x” = (det x)y 

with y £ SL{r,Fi) = Hence det(x) £ F^ Ci Gi C Zi and therefore 0(det(x)) = 1, and 
0(x'’) = 0(det(x))0(y) = 1. So 0 is of order at most r, and since it is a character of a cyclic 
group, there are at most r such characters, giving with K = r. 

(2) The argument is similar except that we now have = m{x)y with y £ Sp{r,Fi) (since 
m{ax) = afx for scalar a), so 0(x)^ = 1. □ 

We will often simply write (when £ is deary specified, e.g. as a summation parameter occuring 
before) 

a{TT,£,...), a{TT,£,...) 

TT TTT^l 

for, respectively, a sum over all the irreducible representations tt £ H^ of G^ or for a sum over 
all those which are non-trivial on Gf. Similarly, a sum of the type 

TT "0 

means (unless otherwise specified) that vr £ H^ and 0 £ Ti/TJ-, in other words, this is a sum 
over all irreducible representations of Gi, parameterized as described previously. 
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We need various estimates involving sums of dimensions of the representations in II^. We 
will phrase them in terms of upper bounds for the “dimensions” of and of the set G\ of its 
conjugacy classes: let s and t be such that the inequalities 

(2.4) \G,\ ^ Cl£^ \G\\ ^ 

hold for all primes £ £ A, ci and C 2 being two given constants. Note that of course s = t = r'^ 
is always possible with ci = C 2 = 1 (and that in fact this does not in general significantly affect 
the applications). 

Lemma 2.4. (1) We have 

^ dimvr < (c2r+*)^/2^ 

and for all tt £ Hi we have 

dimTT ^ 

(2) //G^ = SL{r,Fi), the estimates (j2.4ll hold with 

Cl = 1, s = C 2 = 6”, t = r. 

(3) If r is even, Gf = Sp{r,Fi) and Gi C SSp{r,Fi), the estimates (12.41) hold with 

Cl = 1, s = I + ^ C 2 = 6”/^, t = r/2 + l. 

Proof. For a representation of a finite group G, the dimension is always ^ and the sum 

of the dimension is bounded by Cauchy’s inequality by 

^dimTT ^ 

TT 

so that (1) is a direct translation of H2.4I) . 

(2) is obvious, noticing that the number of conjugacy classes in Gi is at most 

iFiWGfl l)\\Gf\ l)(6£)”-i 

(by |LPl Lemma 1.4] for instance; the factor 6^ ^ takes into account the non-semisimple conju¬ 
gacy classes). 

(3) is similar using the formula for the cardinality of Sp{2g,Fi), and ILH Lemmas 1.3,1.6] 

for the conjugacy classes. □ 

Our last definition is also of crucial importance for the bilinear form estimate: 

Definition. We say that the family (iFi) is linearly disjoint if for all i and P in A, with I P, 

the product map 

7ri(I7,rj) ^ Gf X Gl 

is surjective. 

This is a fairly natural independence notion for the various monodromy groups. In many 
cases it will hold for group-theoretical reasons simply because the G^ are “large” groups and 
close to simple; this is related to Goursat’s lemma, and we quote here the version in 0 Pr. 
5.1], (specialized for 2 factors): 

Lemma 2.5. Let Gi and G 2 be finite groups such that every normal subgroup ofGi is contained 
in the center Gi, and such that G\jG\ and G 2 IG 2 are distinct, simple and non-abelian. Then 
no proper subgroup G C Gi x G 2 projects surjectively on both Gi and G 2 . 

This is typically applied with Gi = G|, G 2 = G^, and G the image of 'i:i{U,rj) —> Gi x G 2 
which does project surjectively on both factors. 

For instance, this shows: 
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Corollary 2.6. (1) Let r be even and let {J-f) he a family of sheaves as above such that = 
Sp{r,Fi) for all I in A, with i ^ 5 if r = 2 and i ^ 3 if r = A. Then the family is linearly 
disjoint. 

(2) Let (iF^) be a family of sheaves as above such that = iS'L(r, F^) for all £ in A, with 
i ^ 5 if r = 2. Then the family is linearly disjoint. 

This follows because it is very classical that the center of Sp{r,F() (resp. 5L(r, F^)) is ±1 
(and is the only non-trivial normal subgroup) and Sp{r,Y()/{±.1} (resp. 5L(r, F^)/{±1}) is a 
simple non-abelian group in the cases described (see e.g. 0 Th. 5.1, Th. 4.9]). On the other 
hand, notice the lemma can not be applied for orthogonal groups. (For instance, if I, £' are odd, 
the proper subgroup 

{(x,y) G 0 (r,F£) x 0 (r,F£/) | det(x) = det(y)}, 

where the equality makes sense because the determinants are ± 1 , clearly projects surjectively 
onto both factors). 


3. Bilinear form estimates and large sieve for algebraic families 


We now state the bilinear form estimate which is our main tool. 


Theorem 3.1. Let U be a variety and {£Fi) a family of sheaves as above, with given sets 
of irreducible representations which are representatives for geometric equivalence. Assume that 
the family is linearly disjoint, that it satisfies and moreover that U and (tFi) satisfy one 

of the following conditions: 

(i) U is a smooth affine curve and {£Fi) arises from a compatible system of integral £-adic 
sheaves; 

(ii) For all i & A, the order of is prime to p. 

Then there exists constants C ^ 0 and A ^ 0 such that we have 
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(3.1) EE 

if^L ueU(Fg) 


Y] a(u)Tr( 7 ro p^)(Fr^ 


^ (kq'^ AGq'^ ^^^lY \a{uY, 

U&U{Fq) 


for any L ^ 1 and any complex coefficients a{u). 

In case (i), we can take A = 1 + s + t/2, and the constant G depends only on U, the 
“geometric" compatible system {iFi) on U and the constants ci and C 2 . In case (ii) we can take 
A = 1 + 5s/2 + t/2, and the constant G depends only on U, ci and C 2 . 

In particular the estimate can be applied uniformly for U 0 Fgn for any n ^ 1. 

Note that the left-hand side of (EU) is in fact independent of the choice of representative sets 

n,. 


Remark 3.2. Here are a few remarks, most of which are of a general nature and are standard 
observations for any type of bilinear form estimate. 

(1) The estimate (|3.1I) is most interesting when L is small enough that ^ so that the 

sum of the two terms and is still of size g^, which is roughly the number of terms 

in the inner sum over u G U{Fq) by the Lang-Weil estimate |[/(Fq)| = q'^ + 0(g^“^/^). 

(2) The restriction to vr 7 ^ 1 in the summation in (13.111 is essential: the additional contribution 
of the trivial representations would give the quadratic form 

e^L u£U{Fq) 

which by Cauchy’s inequality has norm |?7(Fq)|L x q^^L, which exceeds {nq'^ + in the 

most interesting ranges where L is small as in the previous remark. 

(3) In (13.11) . the trivial bound has {nq^ + q'^ replaced by 

e^L TT^i 
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(i.e., the ratio is bounded from above and below; we assume that t is chosen optimally). On 
the other hand, from general principles (see e.g. |IK1 §7]), the best possible result is essentially 
with 


+ L^+^ 


and nowadays it is usually estimates of similar strength which are called large sieve inequalities, 
even when no connection with sieve theory exists. 

Thus from the point of view of the study of bilinear forms in modern analytic number theory, 
the estimate (EU) is much too weak to deserve the name of large sieve. However, we are using 
it mainly to derive a sieve-type result which corresponds to Linnik’s original description of a 
“large” sieve, so we use the word in this sense. 

(4) Using geometric considerations one can get similar results for more general U, for instance 
by dealing with irreducible components one by one. Or if U is the disjoint union of Ui and U 2 , 
with Ui a dense open subscheme which is smooth affine and geometrically connected, and U 2 
closed of codimension ^ 1, the sum on the left of (EU) is at most twice the sum 


I ^ a(u) Tr(7r o p^)(Frt, 
neC/i(FJ 




Q;(n) Tr(7r o p^)(Fr^ 

U&2{¥g) 

and EU) applies to the first sum while the second has a contribution <C mq'^ by Re¬ 

mark (3), where m is the number of irreducible components of 7/2 <8> Fg. Another case would 
be to have a map t/ —> U with “most” fibers being smooth, affine, connected curves on which 
the induced sheaves have the same monodromy as on U, and for which the constant C in EH 
happens to be uniformly bounded for all such fibers. 

(5) Formula below gives a more explicit bound which may be better in some cases where 
more is known about the Gi (although it’s not clear how much of a difference it would make in 
applications), for instance the maximal dimension of an irreducible representation.^ 

(6) Finally, we note that a standard heuristic understanding of the strength of the classical 
large sieve inequality (see e.g. HH 7 . 5 ] for a proof) 

X] X] |y^anX(?^) ^ (iV - 1-F Q2) y]] 

y^(modq) n^N 

* 

(where indicates a sum over primitive Dirichlet characters modulo q), is that in its range of 

effectiveness (i.e., ^ N) it is as strong as the Generalized Riemann Hypothesis, as it gives 

for instance 


^ 


n^N 

on average. And indeed this inequality is used as a substitute for GRH in many applications. 
In view of this and the fact that we already know the Riemann Hypothesis over finite fields by 
Deligne’s work, one may think that a large sieve inequality would be either trivial to prove or 
without application (or both) in this context. That it is not the case illustrates two points: first, 
that in the case of sums over a variety of dimension ^ 2, and for <C y^, the inequality (13.11) 
gives on average square-root cancellation in 


E 


a{u) Tr {tt o p(^)(Flu), 

ueU{Fq) 

which is stronger than the Riemann Hypothesis because the latter only provides a saving of 
g-i/2 general, not q~^l‘^. Thus EU) contains information about the finer average distribution 
of the zeros of the L-functions that are involved in those exponential sums. The second point, 
which is valid even for d = 1, is that the large sieve inequality is not only about cancellation, 


^ This is certainly known for the groups Sp{2g, F^) that we will use below, but the author doesn’t know where 
to find it... 
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but about uniformity in estimates. Hence it is not surprising that the “only” difficulty in 
proving (EU), from the Riemann Hypothesis, is a question of uniform bounds for the error 
terms coming after applying Deligne’s results. 


We will prove Theorem 1^.11 in Section For the moment, we derive a large sieve estimate 
concerning the average distribution of the Probenius conjugacy classes in Gi- 

Let L ^ 2 and suppose that for G A, £ ^ L, we select some conjugacy-invariant subset Q{i) 
of Gi with cardinality such that 


m{x) = v^(—1) G F 


for all X and I (where m : G^ and ip are defined by the commutative diagram (I2.2jl : recall 
that d(Fiu) = —1 G Z for u G U(Fq)). 

Let then 

Piu,L)= 1 

p«(Fru)en(£) 


for u G H(Fq) and 

P{L) = J2^ii)\Gj\-\ 

£^L 

The large sieve statement says that for “most” u, the value of P{u,L) is close to the average 
value P{L), this being measured by the variance. 


Proposition 3.3. 


With U and [Tn) satisfying one of the assumptions of Theorem AS. 11 we have 


(3.2) ^ (P(u, L) - P{L)f ^ L^)P{L), 

ueU{Fq) 


where the constants G and A are the same as in Theorem ro In particular, the cardinality of 
the sifted set 

S{U, Q] L) = {u£ U{Fg) I Fr„ n{£) for all i ^ L} 


satisfies 

(3.3) \S{U, 0; L)| ^ {Kq'^ + Gq^-^/^L^)P{L)-\ 


Proof. First (TOl) follows trivially from (TOl) since P{u,L) = 0 for tt G S{U,Q]L), so that the 
left-hand side of the latter inequality is at least equal to P{L)‘^\S{U,Q] L)\. 

So we prove (TOl) : the argument is in large part a jazzed-up version of the one in u Let xi 
be the characteristic function of Since Il{t) is invariant by conjugation, we can expand 

it in Fourier series using the representations of G^. Using the parameterization as ti ® with 
TT G H^ and '0 £ r^/F^, we can write this expansion as 

(3.4) xe{x)=J2 XKV’, 7r)0(x) Tr 7r(x) 

TTsn^ ipete/rj 


with 

(3.5) 


W(0,vr) 


1 


0(x)Tr7r(x) 


-^0((^(l))7(7r), 


where 

7(7r) = Tr7r(x). 


7(l)=a;(^)|G,^|-'. 
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Thus we have 
(3.6) 





Also by orthonormality of the characters of Gi we have 


\Gi\ 


1 


\xi{x)\‘^ = Y1 = Y1 Y1 

XGG^ TT '(p TT ■0 


|rd' 


r|7(7r)P 



hence 

(3.7) 


E 

TTT^l 


I 7^77 1 2 

I 


<E 

TTT^l 


|7(7r)p 




E‘ 


|7(7r)|^ 


KV 


By (jHIU, H.S.5I) and the fact that iI){p^{Ytu)) = we get that for u G U(Fq) and 

i ^ L we have 

XiipeiFiu)) = ^ T^Mv’b)h{'^)biv>i-'i-)) Tr(7r o pf)(Fr„) 

TT 0 ^ 

= Y ° Pi)(F^n) (t^ Y ^(<^(1))V'(V?(-1))) 

IT ' p 

= Y ^Tr(7rop^)(Fr„) 

TT 


hence (since = 1) 


(3.8) P{u, L)=YY ° (Fr«) = PiL) + YY ° 

using (jHSl). 

Denote by R{u,L) the second term on the right-hand side (the sum over i ^ L and vr 7^ 1). 
By Cauchy’s inequality and (EH) we have 


7(77) 


Y Y R{u,L)T^{'^° Pi){FTu) 

£7i TTj^i l^£l ueUiFq) 

* l7('^)P\ 


ueU{Fq) 


< 


(EE' 


177712 

'£7L vr^l F 


(EE'I E i?(rt, L) Tr(7r o p£)(Fr„)| ^ 

l^L TT^l ueUiF^) 

2\ 1/2 


2\ 1/2 


i?(n, L) Tr(7r o p£)(Fru)| ^ 

l^L ,77^1 ueU{Fq) 

We can apply Theorem l3.1l to the last sum, getting (after squaring) 

( \R{u,L)\^)\p{L){Kq^ + Cq^-GH^) Y 

net/CF,) nei7{Fg) 

SO that 

Y \Rb, L)p ^ {nq'^ + Cq'^-P^L^)P{L), 

ueu{Fq) 

which concludes the proof since by (EIHI) we have 

{P{u,L)-P{L)f= Y \Rb^Lf- 

ueU[Fg) ueUiFg) 


□ 
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4 . Estimates for sums of Betti numbers 


In this section we will prove some estimates for Betti numbers of £-adic sheaves needed in 
the proof of the main estimate in Section |SJ 

For a separated scheme of finite type U over Fg of dimension d ^ 1 and any prime £ ^ p we 
denote as usual 

h\U,r) = dimi/*(C/, J^), 
a{U,r) =Y,h\U,r), 

i i 

xc{u,J^) = Y, {-lYKiu^r), x{u,j^) = Y 

i i 

where J- can be either a Q^-sheaf on U or an F^-sheaf. We also write 

a'(C/,^) = K{U,T) ^ 

i<2d 

for the sum of all Betti numbers except the topmost one. 

We first consider the case of curves. Here the situation will be as follows: U /Fg is a smooth 
affine connected curve, p : 7ri(C/, 77 ) —> G is a surjective group homomorphism with G finite, 
and vr is a representation of G, with values in some Q^-vector space of finite dimension, for 
some £ ^ p. We can form the composite tt o p to obtain a lisse Gadic sheaf on U, which is 
denoted 7r(p). Then we wish to find bounds for the sum of Betti numbers a'^(G, vr(p)) which are 
polynomial in the size of G (or the degree dimTr of vr). 

We do this for p of a special type, which we describe in a slightly more general case than will 
be needed in the next section: G is a product 

G= H Gi 

l^i^k 

where Gj is a subgroup of GL(r, FaJ for 1 ^ z ^ /c, Aj a power of a prime £i ^ p (the G 
are not necessarily distinct), and the representation p is a tensor product pi <8) • • • ® pfc where 
the Pi correspond to lisse sheaves £Fil£i£Fi, which are the reductions modulo £i of sheaves Ti of 
ZAj-modules which are part of a compatible system {Ti). Here Za^ is the ring of integers of a 
finite extension Qa, of Q,, with residue field Fa^. 

Our goal is: 

Proposition 4.1. With notation as above, we have the bound 

a'^{U,Ti{p)) ^ G([/, (JFj),/c)(dim7r), 

for some constant G{U, {£Fi),k) depending only on U, k and the compatible system, but not on 
vr. One can take 

(4.1) G = l-Xc{UMi) + kw 

where w ^ 0 is the sum of the Swan conductors of all Ti at the points at infinity for U, as 
described below, which is independent ofi. 

We start by recalling and setting up the description of the ramification structure of sheaves 
on U, as described for instance in |K2I Ch. 1]. Let G be the smooth projective model of U and 
S = G — U the non-empty finite set of “points at infinity”. Let M be a Z[l/p]-module on which 
7ri([/, p) acts through a finite discrete quotient. For each point x £ S, there is a certain direct 
sum decomposition of M seen as representation of the inertia group Ix at x of the type 

M = ^Mx{t) 

10 


where each Mx{t) is I^i-stable. All but finitely many of the Mx(t) vanish, and those t for which 
7^ 0 are called the breaks of M at x. If M is free over some Z[l/p]-algebra A (e.g., 
A = F^, or Q^), the Swan conductor of M at x is then defined by 

Swan 3 ;(M) = t rank Mx{t). 
t^o 

We let Ba;(M) denote the largest break, i.e., the largest t ^ 0 such that Mx{t) ^ 0. Notice then 
the trivial inequalities 

(4.2) Swana,(M) ^ (rankM) B,r(Af), 

(4.3) ^x{M) < Swanx{M). 

In addition, if M = Mi (g) M 2 we have |K2I Lemma 1.3] 

(4.4) B,(M) ^ max(B,(Mi), B,,(M 2 )) ^ B,(Mi) + B,(M 2 ). 

If M is a finite dimensional Q^-vector space, with Q;, a finite extension of with ring of 
integers 7i\, for some i ^ p, and if M C M is an invariant Zi^-lattice with reduction M/AM, 
then we have |K2l Rem. 1.10] 

(4.5) Swana,(M) = Swana,(M) = Swana,(M/£M). 

Finally, the main reason the Swan conductor enters in our computation is the fundamental 
formula of Grothendieck-Ogg-Shafarevitch: 


Proposition 4.2. If Qa is a finite extension of and T is a lisse Qx-sheaf on U of rank r, 
we have 

(4.6) Xc{U,J^) = rXc{U, Qe) - Swan3,(J^), 

x£S 

where Swana;(.F) is Swana;(M) for the Qx-vector space which is the representation space of the 
representation corresponding to T. 


See e.g. |K2I 2.3.1, 2.3.3] for a sketch of the proof. 

Proof of Proposition EH Since U is affine and smooth we have h^{U,7r{p)) = 0 and 

= hl{U,Ti{p)), 

while the Euler-Poincare characteristic is 


Xc{U,7r{p)) = -hl{U,7r{p)) + hl{U,TT{p)). 

We want to bound —Xc{U,T^{p)), and for this start from the Euler-Poincare formula (j4.6|l for 
7r(/9) (which takes value in some GL{r, Qa)): 

Xc{U,tt{p)) = {dim7r)xc{U,Qe) - ^ Swan3;(7r(p)). 


x£S 


By (j4.2|l we get bounds 

Swana,(7r(p)) ^ (dimTr) B2;(7r o p) ^ (dimvr) Ba,(M), 


where M is the Z[l/p]-module 

M = Ml (g) • • • (g) Mfc, 

with Mi ~ F)(,, with the action p = pi ® ® Pk of 7ri([/, p). The last inequality is simply 

because the action of the inertia group “on” nop factors through that on M. 

Now we have by (jOI) and (jlSI) 


Ba;(M) ^ maxB2,(pi) < ^Bj,(pj) ^ y^^Swanxjpi). 

i i 


Hence 


Swan3;(7r(p)) ^ (dimyr) Swanxjpi)- 


Now we can use the fact that each pi is the reduction of the Z;^.-sheaf and use (II311 to get 

Swana;(pi) = Swana;(J^i (g) QaJ, 
i i 

and by (Ol) again we have for all i 

yy Swan2;(jrj (g) QaJ =rXc{U,Q,\i) -Xc{U,J^i). 

xdS 


The crucial point is that XciU,J^i) is independent of i because the sheaves form a compatible 
system (it is minus the degree of the common L-function of the sheaves J^i), and so of course is 
xdU, QaJ- So the sum of the Swan conductors of the (g) Qa^ is independent of i. We denote 
this common value by w, and thus we get 

-Xc{U,tt{p)) ^ (dim7r)|-Xc(t/, Q^) + EE Swan3;(J^i (g) QaJ I 

X i 

= (dim7r)|-Xc(t^, QJ + kw'^ 

We add the requisite h’^{U,'K{p)) which is trivially ^ dimvr by the co-invariant description 




for any lisse Q^-sheaf T, and therefore we get 

(4.7) (T(.(f7,7r(p)) = hl{U,TT{p)) - Xc{U,tt{p)) ^ (dim7r)(l + C) with C = -Xc{U,Qe) + kw. 


□ 


We now come to the result that will be used for the case where Assumption (ii) of Theorem lh.il 
holds. We will use the following result of Katz, building on work of Bombieri and Adolphson 
and Sperber: 

Proposition 4.3. Let q = , U/¥q a smooth affine connected scheme of dimension d ^ 1 

which can be embedded in as a closed subscheme defined by the vanishing of r polynomials 
of degree ^ <5. Then we have 

adU,Qe)^A{N,r,5) 
for some constant A(N,r,5); one can take 

(4.8) A{N, r, 6 ) = 2'’6(3 + rd)^+^. 

This is Theorem 2 of IKH together with its corollaries. 

Proposition 4.4. Let q = p^, Lf/¥q a smooth affine connected scheme over Fg of dimension 
d 1, ip : V ^ U a finite etale connected Galois covering of degree prime to p. There exists a 
constant C{U) such that 

(4.9) adV,Qt)^C{U){degp). 

More precisely, if d = 1 one can take C{U) = (Tc(17, Q^). If d ^ 2, let N, r, 6 be as in 
Proposition \ 4 .d\ f 0 r U. Then one can take C{U) = C{N,r,5), where 

N-l 

(4.10) C{N, r, 6 ) = 2 A{j, r, 6 ) + A{N, r, 6 ) ^ 12iV2^(3 + rd)^+J 

i=i 

Something like this may be already known but we haven’t found it in the literature. The 
proof will proceed by induction on d, following the method used by Katz in HH Th. 2]. For 
the induction step we need the following version of an affine Lefschetz theorem: 
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Proposition 4.5. Let U/Fq be a smooth connected affine scheme over of dimension d^ 2, 
ip : V ^ U a finite etale connected Galois covering with Galois group G. Fixing an immersion 
i : U ^ for some N ^ 1, there exists an affine hyperplane H C A^ such that U H H is 
connected and smooth, W = ip~^{U Cl H) is connected and smooth, and in the diagram 

W -> V 


‘Pi 


p> 


UGH -> U 

the map ip\ is a finite etale Galois covering with group G and the induced maps in etale coho¬ 
mology 

(4.11) 

are isomorphisms for i < d — 1 and injective for i = d — 1. 


Proof. For any hyperplane H, it is of course true that W UCiH is a finite etale covering with 
Galois group G, possibly disconnected. However there exists an open dense set of hyperplanes 
H for which W is indeed connected by |K3I Cor. 3.4.2] with the data {k, E, f, it) = (Fg, V, 0, i). 
Further, the existence of an open dense set of hyperplanes H such that the induced maps 

satisfy the required condition is the special case of |K31 Cor. 3.4.1] for the data {k,E,f,Tr) = 
(Fg, V,0,io ip) (compare the proof of |K31 Cor. 3.4.2]). The existence of a third open dense set 
of H for which U D H is smooth connected is Cor. 3.4.3 of loc. cit. 

Intersecting those three open dense subsets of hyperplanes, one finds one where all the re¬ 
quired conditions hold. □ 

Proof of Proposition First because U is smooth affine and ip etale, hence finite, V is also 
affine and smooth. 

We recall now some deep facts about etale cohomology. First, since V is smooth and connected 
we have (TciV, Q^) = cr{V, Q^) by Poincare duality (see e.g. m VI.3]). 

Next, by the affine cohomological dimension theorem we have 

(4.12) H\U, Q^) = H‘(V, Q^) = 0 for i > d, 
see e.g. |D21 IV.6.4]. 

Finally, because ip is an etale Galois covering of degree prime to p, it is moderately ramified 
and we have 

(4.13) x(V,Qi) = (degip)x(U,Q,) 

which is due to Deligne-Lusztig for Xc (see ^ 2.6, Cor. 2.8]), and we have X = Xc for U and V 
as proved by Laumon [L] . 

Now we are ready to start the proof by induction. Consider first d = 1. We have by 
and Poincare duality which gives h^(U, Q^) = h^(V, Qe) = 1 that 

aiU, Qe) = 2- x{U, Qe), a{V, Qe) = 2 - x(V, Q^). 

By (imi we get 

(^{y,Qe) = 2 - (deg(/?)x(tl, Qf) ^ {deg (p){2 - x{U,Cle)) = deg{ip)a{U,Cli) 

so that we can indeed take C{U) = adU, Q^) in that case. This is the (first) conclusion required 
for d = 1. The alternative bound is also valid since a{U,Qi) ^ A{N,r, 6 ) ^ G{N,r,5) with N, 
r, 6 as described, by Proposition 14.31 and G{N,r,5) defined by (14.101) . 

Now assume that dim!/ = d and (|I3I) holds for dimension d — 1 with the constant (ICTl . 
Fix an embedding i : t/ —> A^ for some N (with the attending r and 6 ). By Proposition 14.51 
there exists an hyperplane H C A^ such that the maps (ICTT) are, in particular, all injective 
for i ^ d — 1. This implies by 

a{V,Qd ^ a{W,Qe) + hdV,Qe) 
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on the one hand, and on the other hand we find 

h\V, Qi) ^ h<^{V, Qe) + h^-\W, Qe) - h^-\V, Q,) = Qe) + Q,), 

so that altogether we have the inequality 

Qi) ^ Qe) + Qi) + a(W, Q,}. 

Now using twice for y —> ?7 and W U il H, which are both Galois with group G of 

order prime to p, we find 

^7(y, Q,) ^ (deg ^){i-lfx{U, Qe) + D H, Q,)) + Q,). 

By induction applied to hh U H H, since U H H is embedded in ~ using the same 

number and degree of polynomials as U, we can estimate the last term by 

a{W, Q^) ^ (deg p)C{N - 1, r, <5) 


and get 

u(y, Q,) ^ (deg ^)[{-lYx{U, Qe) + D H, Q,) + C(iV - 1, r, 5)}. 

Since 


|x(C/,Q^)| ^cT([/,Q,)^A(iV,r,5) 

\X{U n H, Q,)| ^ a{U n H, Qi) ^ A{N - 1, r, 6 ), 
by Proposition ESI we get 

(^{y,Qe) ^ (deg(^)|A(A,r,5) + A{N - l,r,5) +C{N - l,r,(5)| = {degip)C{N,r,5), 

which is the result for U. The last estimate in is a crude consequence of the corresponding 

one for A{j,r,6) given in (14.81) . □ 

Proposition 4.6. Let UjFg be a smooth affine connected seheme of dimension d ^ 1. Let 
p : TTi{U,ri) G be a surjective homomorphism with G finite of order prime to p, let ir : G —>■ 
GL{r,Qi) be a representation of G and tt{p) = tt o p the corresponding lisse sheaf on U. There 
exists a constant G{U) depending only on U such that 

(4.14) c 7 c{U, 7 r{p)) ^ C(C/)|G|(dim7r). 

Proof. (Compare with (4) of Theorem 9.2.6 of Katz and Sarnak |KSlj l Let p : K —> ?7 be the 
connected etale covering with group G corresponding to the kernel of p. It follows that p*{7r{p)) 
is trivial on V, i.e., seeing 7 r{p) as a Q^-lisse sheaf, where Qa is a finite extension of for 
which TT has image in GL{r, Qa), we have 

p*7r{p) ~ Q^. 

Since K ^ ?7 is etale and Galois, the Galois group G acts on the cohomology groups 
HffiV, p*tt{p)) and we have (by the Hochschild-Serre spectral sequence for K —> t/ for instance) 
for all i 

hence 

afiU, tt{p)) ^ afiV, p*'n{p)) = afiV, Q\) = rofiV, Qa) = rafiV, Q^) 

by the formula Qa) = HliV, Q^) (g) Qa- 

Since the group G is assumed to have order prime to p, we have by Proposition 14.41 

afiV,Qi)^C{U)\G\ 

for some constant C{U) independent of tt, and the proposition follows by combining these two 
inequalities. □ 
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Remark 4.7. Contrary to our first optimistic version, the condition that (deg(^,p) = 1 in 
Proposition 14.41 is certainly necessary, as the following example (communicated by Katz) shows: 
take U to be the affine line with coordinate x, and take V = to be the curve — y = x'^, 
for d prime to p. Then we have adVd) = 1 + (p — l)(d — 1). So as d grows, although the degree 
of the covering 14 —> [/ stays p, we see that (JciVd) is unbounded. 

Since the covering is also Galois with group Z/pZ in this case, this also shows that Proposi¬ 
tion EM does not extend to arbitrary groups: the covering K —> [/ corresponds to a surjective 
map p : —> Z/pZ, the representations n = ip are the additive characters of Z/pZ, and 

we have 

CTciVd, Qi) = ^ CTciU, dip)) 

Ip 

(which amounts to the standard counting of points on Vd{Rqd by means of additive character 
sums, or the construction of the sheaves corresponding to those sums) and therefore a bound 
like (14.141) would give adVd) d p‘^C{U), which is also incorrect. 

The condition that the covering be Galois is necessary for the proof of Proposition l4.4l because 
otherwise may fail, even for a covering of degree prime to p, as in the following example 

(again communicated by Katz): take the finite etale covering Gm —> A^ (over Fg) given by 
X xP + Ijx. We have x(A^) = 1, whereas x(Gm) = 0, and the covering is of degree p -|- 1. 

Still one may hope that an analogue of Proposition 14.11 holds for arbitrary U, which would 
give a corresponding general version of Theorem 14.II and its applications. 


5. Proof of the bilinear form estimate 


We come back to the notation of Section |21 before and in the statement of Theorem 14.11 which 
we will now prove. 

The analytic principle for the proof of Theorem 14.11 is quite simple and very well established 
in analytic number theory. We proceed by duality, as first conceived by Vinogradov: for given 
L ^ 1 and A ^ 0, it is equivalent to prove that 


EE*I E a{u) Tr(7r o p£)(Fru) 

TT^l U(iU{¥q) 


«e;7(Fg) 


for arbitrary a(u) G C, or to prove that 


(5.1) 


E lEE* /?(^,7r) Tr(7r o p£)(Fr„) 

U(iU{¥q) i^L 


4 A^^>(4vr)|2. 

TT^l 


for arbitrary /?(£, tt) G C. Recall that vr runs over a set 11^ of irreducible representations of Gi 
up to twist by characters of F^. 

The dual form is more manageable here. Denote by ®(/3) the left-hand side of (15.Ij) . Ex¬ 
panding the square we get 

(5.2) ®(/3) = j;>(4vr)^(^e(47r;£',7r'), 

with 

6(47r;£',7r') = ^ Tr(7r o p4(Fr„)Tr(7r'o p^/)(Fr„). 

The crucial point is the following estimation for the individual S(£, vr; T, tt'). 


Proposition 5.1. With notation as above, and in particular under the assumption that the 
sheaves are linearly disjoint. 
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(i) If the monodromy groups are of prime-to-p order, we have 

|6(£,7r;^,7r) - \ff\q^\ ^ q'^-^/^\G,\{d\rei7:fC(Jj), 

|6(£,vr;7r')| ^ g'^“^/^|G£|(dim7r)(dim7r')C'(?7 ), z/tt/tt' 

|6(£,7r;£',7rO| ^g''-^/2|G,||G£.|(dim7r)(dim7rOC(I7), if I ^ I', 
where G{U) is given by Proposition \T1\ 

(ii) If U is a eurve, and the sheaves arise from a compatible system of Zx-sheaves Ti, we 
have 

|6(£,7r;£,7r) - \tj\q'^\ ^ D{U, (^^)), 

|S(£, vr;/, 7r')| ^ q'^~^^'^{dim7T){diunT')D{U,{iFe)), if £ £' or tt tt', 

where D{U,{!Ff)) is the constant G{U,{£Fi),2) of Proposition EH 

Taking this for granted, we finish quickly the proof of Theorem Khll By H5.2I) we have 
trivially do with 

£' ttVI 

and by Proposition Ib.lL we thus get (ED) with 

(5.3) A = max||f£ + g'^“^/^C'(17)|G^|(dim7r)|^ (dimTr') + ^ |G^,| ^ (dim7r')|| 

TT^ n'^l 

in the case of monodromy of order prime to p, and 

(5.4) A = max||f^ (^,))(dim7r){ (dimTr') + (dimvr')}} 

7t' TT^T^l 

in the case of a curve with a compatible system. We estimate all those terms in terms of the 
parameters s and t of dH using Lemma 1231 (1). In the first case we obtain by appealing also 
to d2ISl) and to dlSI) that 

(5.5) A ^ k/ + 2g'^-^/2^(I7)(cfc2)^/2Li+5*/2+i/2_ 

In the second case we obtain similarly 

(5.6) 2q'^-^/‘^D(U, {;Fi))cxcj'^. 

Thus Theorem EH follows by duality. 

Proof of Proposition 15. ?[ The proof is now an easy application of the Grothendieck-Lefschetz 
trace formula and Deligne’s main theorem of m (compare with P p. 162,163]). The only 
subtlety is that the dependency of the error terms on £, £', vr, tt', must remain controlled, and 
for this we need the results of Section E) 

!£.£ = £' we let G = Gi, G^ = Gf. The representation pi gives a surjective map 7ri(f7, r/) —> G. 
Let T = vr <8) 7^^ This is a (not necessarily irreducible) representation of G, and we will consider 
the sheaf IF = ropi, which is of the type considered in Proposition I4.6l a.nd Proposition 14.1 l lafter 
seeing the representation r as taking value in GL{r, Q^), as we can since it is a representation 
in characteristic 0). 

li £ £', we let G = Gg x Gii, G^ = G^^ x G^,. By the assumption that the family of sheaves 

is linearly disjoint, the product map {p£, p'^) is still a surjective map 

7ri(Z7,?7) GT 

Let T{g,g') = 'K{g) ^irfg') (the “external” product), so r is an irreducible representation of G. 
We will consider the sheaf F = t o (p^, p^), again of the type considered in Proposition 14.61 and 
Proposition 14.1 L 
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In both cases, because G is a finite group, the eigenvalues of the image of r are roots of unity 
so T is punctually pure of weight 0. 

Also in either case, the main point is that the local trace at tt G of T is given by 

construction by 

Tr(Fr„ | = Tr(7r o p^)(Fr„)Tr(7r'o p^/)(Fr„) 

and therefore the fundamental Grothendieck-Lefschetz Trace Formula (see |Grj . |D2j . |M1 VI. 13]) 
states that 


©(£,7r;^',7r0= TV(Fr„ | .F) = ^ (-l)*TV(Fr | .F)). 

ueU(Fg) 

By Deligne’s Weil II Theorem |ml p. 138], the eigenvalues of the geometric Frobenius auto¬ 
morphism Fr acting on are algebraic integers all conjugates of which are of absolute 

value ^ 

It is easy to compute J-) (the action on which contributes potentially terms of maximal 

size using the formula 

= Wc.i-d) 


where V = is the space on which the representation which “is” T acts and W is the space of 
the representation r of The second equality above holds because the disjointness condition 
shows that the map 'Ki{U, rf) —> G^ through which the action factors is always surjective (as 
already observed previously). 

The crucial point is that this coinvariant space is zero unless I = & and vr = vrh Indeed, if 
£ / decomposing tt and vr' restricted to Gf and G^, as sums of irreducible representations, 
the dimension of the coinvariant space (which is the same as that of the invariants under G® 
because we are working with finite groups) is the sum of the dimension of invariants for the 
pairwise tensor products of the components; but those are non-trivial irreducible representations 
of = Gf X G|, so each term of the sum is zero. 

If ^ = i', the last statement in I;emma, [2.1 l exactlv says that the space of invariants is of dimen¬ 
sion |F^| for TT = tt' and 0 otherwise; this is where it is necessary to restrict to representations 
unrelated by twists. 

Thus we derive the bound 


Tr(7r o/3£)(Fr„)Tr(7r'o p£/)(Fr„) 

ueu(Fq) 




(t/,.F) 


if {i,TT) / (^^7I■0 and 


Tr(7r o p£)(Fr„)Tr(7r'o p£/)(Fr„) 

ueU{Fg) 


■ilQ 






otherwise. 

Inserting the bounds for from Pronosition 14.II or Proposition UHl respectively, and 

looking at the various cases, the proposition follows. □ 


Remark 5.2. If the conditions of Theorem rm are not satisfied, we see that we still get an 
inequality 


EE’l E a{u) Tr(7r o p£)(Fr„) 

£ TT U£U{Fq) 


UGUiFq) 


D 


max 

£,7r 


£' TT^T^l 
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for fixed L, with 








The point is that D is independent of q so this is still non-trivial when applied for U x Fgn with 
n ^ + 00 . In a large sieve context as in Proposition ESI it leads to 


lim sup 

n—>-\-oo 


|5(17 X Fqn , n -, L )\ 


^nd 


1 

P{L) 


for fixed L, and using the same sieve as we will in the proof of Theorem IfiTI for all L ^ 2 and 
taking L +oo, this recovers Chavdarov’s irreducibility theorem Th. 2.3] for an arbitrary 
family C/U of genus g curves with geometric monodromy modulo (. equal to Sp{ 2 g^F() for 
almost all t. 


|{m e C/(F„n) I det(l — TFiu \ has small Galois group}] 

lim - - - 

-+00 1C/(F,„)1 


6. Zeta functions of families of curves 


We now come to the application of the large sieve to a strong form of Chavdarov’s Theorem 
on the generic behavior of the numerators of zeta functions of curves in families. If the genus 
is fixed, most of the work is already done in the previous sections or in Chavdarov’s paper, but 
we will look for arguments uniform with respect to g so that, in some cases at least, we obtain 
results valid even for g large (though not for q fixed, g —> +oo). 

First, we recall the definition of the zeta function of a curve over a finite field, in concrete 
terms (so the statements at least can be understood without knowledge of etale cohomology), 
by recalling the diophantine meaning of the polynomials involved. 

Let C/Fg be a smooth projective curve of genus g over a finite field (all curves here and 
below are assumed to be geometrically connected). Its zeta function Z{C) is the formal power 
series given by the diophantine definition 

Z(C)=exp(j;M^r"). 

n>l 


where jC'(Fqn)] is the number of “solutions” to the equations which define C with coordinates 
in the extension field Fgn. A fundamental result due to F.K. Schmidt in this case is that there 
exists a polynomial Pc G Z[T] of degree 2g with T’c(O) = 1 such that 


Z{C) 


Pc{T) 

{l-T){l-qTy 


The cohomological definition is that the polynomial Pc{T) can be described as the (reversed) 
characteristic polynomial of the geometric Frobenius automorphism acting on a suitable etale 
cohomology group, specifically 

(6.1) PciT) = det(l - TFr 1 H\C, Z^)). 


The question investigated by Chavdarov concerns the splitting field of this integer polynomial 
as C varies in an algebraic family, e.g. in a hyperelliptic family 

Cu ■■ = f{x){x - u) 

where / is a fixed polynomial in Fg[A] of degree 2 g with distinct roots in Fg, and u is the 
parameter that can take any value in Fg which is not a zero of / (these conditions ensure that 
the curve Cu suitably “compactified” is a smooth projective curve of the given genus g ^ 1). 

There is an a-priori condition on the splitting field of the polynomial Pc because it satisfies 
the “functional equation” 

r2^Pc(|) =^^0(2”), 

(or equivalently, if a G C is a root of Pc, then qa~^ is also a root). This means that the 
“splitting algebra” Q[T]/(/) has Galois group G which can be seen as a subgroup of the group 
W 2 g of signed permutations of {!,..., 2g}. In other words, W 2 g is the group of permutations 
of g pairs of elements preserving the pairs. In particular, if the polynomial is irreducible, its 
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splitting field has maximal Galois group G ~ W 2 g if and only if the splitting field is of maximal 
degree \ W 2 g\ = 2 ^ 51 !. 

In terms of etale cohomology the functional equation above is a consequence of the Poincare 
duality (in this case, it also amounts to the Weil pairing for the Jacobian variety) which states 
that there is a natural non-degenerate alternating pairing (“cup-product”) 

( 6 . 2 ) 

Note that this implies that the “global” geometric Frobenius Fr of Fg acts on as a 

symplectic similitude for this pairing, with multiplicator q. 

Here is now our first general result about the behavior of the splitting fields in a suitable 
family, which significantly strengthens the results of Chavdarov. 

Theorem 6.1. Fix an integer g ^ 1. Let q = and let U/Fq be a geometrically irreducible 
smooth affine scheme of dimension d ^ 1 such that one of the following two conditions is 
satisfied: 

(i) U is a eurve, i.e, d = 1, 

(ii) we have p > 2g + 1. 

Let IT : C ^ U be a proper smooth family of projective eurves of genus g over U. Assume 
that for all i > Lq the geometric monodromy group of the integral sheaves R^7r\Zi£ is the full 
symplectic group Sp{2g). Then the number N{U/Fq) of u G U(Fg) such that the numerator 

(6.3) = det(l -TFV | H\Cu,Qi)) G Z[r] 

of the zeta funetion of the curve Cu = 'k~^{u) is reducible or has splitting field with degree 
strictly less than 2^g\ satisfies 

N{U/Fq) ^q^-^logq) 

for 7 = 4g2_^3g_|_5 in ease (i) and 7 = in case (ii), where the implied constant depends 

only on Lq, g and U/Fg. 

Here is another almost equivalent way of phrasing this: consider the zeta function 

ZW=exp(2ihfi^,-), 

It follows from Theorem 16.1 1 that it extends to a holomorphic function on the half-plane Re(s) > 
d — 7. 

The second result is a uniform version (in terms of g) for the families of hyperelliptic curves 
already introduced. 

Theorem 6.2. Let g 1, p 2, q = p^ with k 1. Let f £ Fg[X] be a monie polynomial 
of degree 2g with distinct roots in Fg, U C be the complement of the set of zeros of f and 
denote by n : C ^ U the family of hyperelliptic curves of genus g given by 

Cu : y'^ = f{x){x - u) 

completed by the section at 00 , with projection 7r(x, y, u) = u. 

Then the number N{f,q) of u £ U{Fg) sueh that the polynomial 

Pu = det(l - TFr I H\CuMi)) G Z[r] 

is either redueible or has splitting field with degree strictly smaller than 2^g\ satisfies 

N{f,q) <C ^^■'^(logg) 

for 7 = , where the implied constant is absolute. 

Note that the uniform bound in this last result is only non-trivial if g^ is somewhat smaller 
than logy, precisely if = (logwith logy = o(/(y)). Still, it is an uncommon feature 
to be able to say anything for this kind of problems in a situation where g and y increase 
together, instead of having first y —> -|-oo (compare with the discussion in imi Introduction]). 
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Since Theorem lh.2l is more delicate, we will start by proving it in the Section |H1 after some 
common preliminaries; we will then quickly deal with the somewhat simpler case of Theorem l6.ll 
Here we just make a few additional remarks which are of independent interest. 

First of all, since the estimate of Theorem lO is (in particular) uniform in g, it can also be 
used in “horizontal” direction, i.e., with q = p varying. For instance, we deduce the following 
quite easily: 

Proposition 6.3. Let g ^ 1 be an integer, f G Q[^] be a polynomial of degree 2g with distinct 
complex roots. For n G Z not a root of f, let Cn/Q be the hyperelliptic curve of genus g with 
equation 

Cn : y'^ = f{x){x - n) 

and let Jn be its Jacobian. Then for N ^ 3, the number S{N) of integers n with |n| ^ such 
that Jn/Q, is not simple up to isogeny satisfies 

S{N) < N^/^-^{loglogN) 

where 6 = ^ 4 g 2 _^ 3 g _|_5 ■ The implied constant depends on g and the splitting field of f. 

This should be compared with the individual global results of 0 §6]; our result is on average, 
but note that we do not require any information on the image of the Galois representations 
associated to Jn, and in particular we get results valid for all genus, independently of the 
endomorphism ring of Jn or any other global property. 

Proof. Denote first by Q/ the set of primes p totally split in the splitting field of /. Notice that 
for any X ^ 2 we have the sieving estimate 

S{N) ^ |{n G Z I |n| ^ N and n (modp) ^ D(p) for p G Q/, p ^ X}| 

where 

Ll{p) = {t G Z/pZ I f{t) ^ 0 (modp) and det(l — TFr^ | Q^)) is irreducible.} 

By Theorem 16.21 there exists a constant C ^ 0 such that for all p we have 

(6.4) \n{p)\^ p-Cp^~'^{logp) 

with 7 = usual (strong) form of the large sieve (see e.g. 0 Th. 6], |IK1 Th. 

7.14]), we have 

(6.5) S{N) ^ {N + X^)J-^ 
where 

T _ TT I^(P)I 

ii p-Mp^ 

q^X p\q 
peQf 

the b sign indicating a sum restricted to squarefree numbers. 

Just taking primes into account we get by (lOl) 

J > ^ p'^(logp)“^ > X^+'’'(loglogX)“^, 
peQf 

by the Chebotarev density theorem, and taking X = the proposition follows from (|6.5jl . □ 

Another corollary of the large sieve estimates is to families of abelian varieties. 

Corollary 6.4. Let q = p^ and g ^ 1 such that p > 2g 1. Then the number N{g,q) of 
isomorphism classes of principally polarized abelian varieties A/Fq such that the polynomial 
det(l — FrT | 77^(A, Q^)) is either reducible or has splitting field with Galois group strictly 
smaller than W 2 g satisfies 

fV(ff,g)«(?5D+i)/2-7(iogg) 

where 7 = and the implied constant depends only on p and g. 
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We will prove this at the same time as Theorem 16.11 

Finally, here is an (amusing, but not too far-fetched) illustration of what Theorem l(i.ll gives 
which can not be derived from [Hj . 

Proposition 6.5. Let g ^ 1, q = , U/Fq a non-empty open subscheme ofGm/Fq, tt : C —>■ U 

a family of smooth projective curves of genus g such that the geometric monodromy group of 
R^TT\Zii is equal to Sp{2g) for almost all i. Then for all n large enough, there exists a primitive 
root t G UfFqn) C such that det(l — TFr* | Z^)) has maximal Galois group. 


Proof. Indeed, the set of t G GmfFqn) which are primitive roots has cardinality — 1) and 




1 )» 


g" - 1 


> 


log log(g’^ — 1) log n log log q 


for n ^ 1 with an absolute implied constant (see e.g. |hW 1 Th. 328] for this standard estimate), 
and this lower bound is larger than the upper bound given by Theorem 16.11 for those t for which 
the numerator of the zeta function of Ct has small Galois group, if n is large enough. (So in 
fact, most primitive roots t will have the desired property). □ 


Remark 6.6. For any k coprime with q we can find n such that q'^ — 1 = 0 (mod k) and then 

p{q^-l) ^ 
q^ — 1 ^ k 

Choosing suitable values of k, we see that the density of primitive roots in Fg„ is not bounded 
from below by any positive constant. This means that ProDosition l6.5l can not be proved without 
a quantitative form of Chavdarov’s theorem. 


7. Preliminaries for the proof of Chavdarov’s theorem 

We start with some preliminaries related to the group W 2 g and to setting up a sieve for 
characteristic polynomials of symplectic similitudes. 

From the description of W 2 g we see that there is an exact sequence 

1 ^ {±1}5 _^W 2 g^ 6 g^l 

where the second map just look at the permutation of the pairs, and the kernel corresponds to 
just switching the elements of the pairs without moving them. We also denote by i the natural 
inclusion i : W 2 g & 2 g- 

Our first lemma describes various ways of ensuring that a subgroup of W 2 g is equal to W 2 g. 

Lemma 7.1. Let g ^ I and W C W 2 g be a subgroup ofW 2 g. Assume that one of the following 
conditions is true, where i : W 2 g G 2 g is the embedding above: 

(i) For any conjugacy class c C W 2 g, we have cCiW / 0. 

(ii) The subgroup i{W) contains a 2-cycle, a A-cycle, a {2g — 2)-cycle and a 2g-cycle. 

(iii) The subgroup iiyV) contains a transposition and acts transitively on {1,... ,2g}; more¬ 
over, the projection piW) contains a transposition and an m-cycle with m > gl2 prime. 

Then in all cases we have W = W 2 g. 

Proof. Case (i) is a standard result in finite group theory (see e.g. P Lemma 5.8]), which is in 
no way specific to W 2 g. 

Case (ii) is Lemma 2 of |DDS] . 

For case (iii), observe first that the first condition already implies that W = W 2 g if 5 = 1. 
Otherwise we see that p{W) acts transitively on { 1 ,... , 5 } and so with the second and third 
conditions, we get p{W) = &g by the result of Bauer given in m Lemma, p. 98]. Since i{W) 
contains a transposition, we deduce that W = W 2 g by Lemma 5.5 of [Ul- n 

For Theorem EH we can use Case (i) or Case (ii) of the lemma, but Theorem E21 requires 
the finer Case (iii), the point being that the conditions involve “large” subsets of W 2 g. It 
seems to be an intriguing problem in combinatorial group theory to determine how optimal this 
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statement is. In terms of &g, this means the following optimization problem: let fli, ..., 
be conjugacy-invariant subsets of 6 g, and let 

diili) = mm-. 

g'- 

How large can 5(0j) be if (Hj) are chosen so that no proper subgroup of &g can intersect each 
Hj? Bauer’s lemma gives two subsets with ( 5 (r 2 i,H 2 ) l /\/5 (see below). 

To set up our sieve, it will be convenient to say that a polynomial f € A[T] [A any commu¬ 
tative ring) of degree 2g such that /(O) = 1 and 

=/(r), 

is q-symplectic of degree 2g {q will often be fixed and clear from the context, as will g).^ Hence 
the numerator of the zeta function of a curve C/Fg is g-symplectic. 

We now prove a general result comparing a sieve related to characteristic polynomials of 
elements with multiplicator q in the finite group SSp{2g,F£) of symplectic similitudes to the 
“same” sieve applied to all ( 7 -symplectic polynomials of degree 2 g. 

Recall that we denote by m{g) the multiplicator for a symplectic similitude, i.e., 

{gv,gw) = m{g){v,w). 


Lemma 7.2. Let g ^ 1 and i a prime. Put 

'^g,e = {/ G Fi[T] I / is q-sympleetie of degree 2g}. 

Let (l{l) C he an arbitrary subset of eardinality uj{i) and 

n{£) = {g e SSp{2g,Fe) \ m{g) = q, and deg(l -Tg) e H(£)}, 

with cardinality u}{i). 

Then we have 

u{l)\Sp{2g,F,)\-^^Co{l){l + l)-T 


Proof. We have 

^ \{9 ^ SSp{2g,Fe) \ m{g) = q and det{l-Tg) = f}\. 
fen{e) 


The inner quantity, for given /, is exactly what is estimated by Chavdarov in P Th. 3.5], in 
the proof of which it is called A. Using the formula at the bottom of page 159 of loc. cit., we 
get 


uj(£) 


- Y. |Sp( 29 .Ff)| 


^S(f) 


/eo(r) 


\CiAf)(F,)\' 


where Af is a fixed semisimple element in SSp{2g,Fi) with multiplicator q and characteristic 
polynomial / (its existence being proved in [HI Lemma 3.4]), and C{Af) is the centralizer of 
Af in Sp{2g,Fi), 6{f) ^ g being its dimension. Thus 


u;i£)\Sp{2g,Fe)\-^ 




By the formula of Nori at the top of page 160 of loc. cit., which holds essentially because C{Af) 
is known to be a geometrically irreducible variety of dimension 6 {f) ^ g, we have 

u;{£)\Sp{2g,Fe)\-^ ^ £->^ ^ ^ 

fen{e) 

as required. □ 


o 

The terminology “self-reciprocal” is often used when q — 1. 
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The next results are technical estimates which are only required in this precise form for the 
proof of the uniform version of Chavdarov’s theorem. Easier versions (found in 0, EdHI) 
suffice for Theorem EH 

Recall the following terminology: if / is a monic polynomial of degree g in Z[r] which 
factorizes modulo a prime d. as 

with the fi coprime, irreducible, of degree d* ^ 1, then one says that the cycle type (or the 
conjugacy class) associated to / is the conjugacy class in &g of elements which are product of 
disjoint cycles of lengths di, ..., dr- 


Lemma 7.3. (i) Let g ^ I and let c be a conjugaey class in 6g. For i prime, let 

Clc{d) = {f & F£[T] I f is monic of degree g and the cycle type associated to f is c}, 
and = |Qc(^)|- Then we have for £ > Ag'^ 


UJcii) ^ 


e 


(ii) Let g ^ 1 and for t prime let 0 Ji[£) he the number of q-symplectic irredueible polynomials 
in F£[r] of degree 2g. Then for £ > Ag'^ we have 

(hi) Let <7^1 and for £ prime letuj 2 {£) be the number of q-symplectic polynomials of degree 2g 
which factorize as a produet of an irreducible quadratic polynomial and a product of irreducible 
polynomials of odd degrees. Then we have for £ > Ag"^ 

£^ / 1\9 

Proof. We start with (i). If the conjugacy class c is that consisting of permutations with r* 
distinct i-cycles in their decomposition, with 1 ■ ri A- ■ ■ ■ A- g ■ Vg = g, then we have 


n and(he(^)= n 


| 6 „ 








p{i,£) 


where p{i,£) is the number of irreducible monic polynomials of degree i in Ff[r]. Now we claim 
that we have for all £ > Ag^ and 1 ^i ^ g the lower bounds 


(7.1) 


(7.2) 




- 1 , 


p{i, £) ^ ^(^1 - + ^ - 1 for 2 ^ i g. 


From this, which we prove below, we derive for alH, 2 ^ ^ s' and ^ g/i that 

p{i,£){p{i,£)-l)---{p{i,£)-ri + l) {p{i,£) - g/i + 


p{i, £) 
ri 


and for i = 1, r^ ^ g that 




rp. 


^ 1 - 


1\’’i 1 


Fi r.' I 


p{l,£)\ _ p{l,£){p{l,£) - 1) ••• (p(l,^) - ri + 1) ^ {p{l,£) - g + iy^ 


n 


^ 1 


1 \n 


ril 

1 \ T’l 1 

1 - 4 ) 


ri! 


£J V^ril 
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Hence, putting these together, we get 



as desired. 

Now we prove (EU and (ESI). We use the well-known formula of Dedekind 

* d\i 

In particular 

for i > by inspection. Similarly for i = 2 we have 

if i ^ g. For z ^ 3 we use the lower bound 

p{ij) > ^ 

i 

(see e.g. 10 Lemma 3.1]), so that it suffices to show that 



for (. > Ag"^. This is equivalent with 

— iX — g > 0 where X = 

and the quadratic polynomial has largest root equal to 

a = ^(z + \/i^ + A:g/t) < 2ii 

for z ^ 3 and g < \/£/2. Hence for z ^ 3, > 2g we have trivially X = > 2ig ^ 2ii > a, 

so the quadratic polynomial must be > 0 when evaluated at X, which gives 
Coming to (ii) we have (compare [DDSl Lemma 3]) the lower bound 

^i{^) > p{g, (■) - ^p{9,0) - 

This is because we can count irreducible polynomials of degree g in F£[T], minus those for 
which / = T^h{qT is reducible; in this case, / is of the form ch{T)T^h{qT~^) (for some 

normalizing constant c 7 ^ 0 ) where h is irreducible of degree g and not g-symplectic, with both 
h and T^h{qT~^) yielding (with proper normalization factor, so they are distinct up to scalars 
by virtue of h not being g-symplectic) the same reducible /. From the irreducible h, we exclude 
the g-symplectic ones by the trivial bound for their number, hence the inequality above. 
Using dza) for i = g we get 



Finally for (hi) we consider separately the case where g is even or when g is odd. For even g, 
the number L 02 {i) is larger than that of g-symplectic polynomials / of degree 2 g of the form 

/ = fihih2 
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where /i is an irreducible quadratic g'-symplectic polynomial, and hi, /12 are irreducible of odd 
degree g — l with, up to a constant, hi = T^h 2 {qT~^). Counting the possibilities we get by (17.21) 
that 


U}2{i) ^ 


e 


1 - 





ff-i 


£9 



9 


hence a stronger result than claimed in this first case. 

The case where g is odd is similar with polynomials of the form / = /i/ 2 / 3 hi/i 2 with fi 
quadratic irreducible and g-symplectic as before, /2 = 1—aT for some a / 0 and /s = l—qa~^T, 
and hi, h 2 as in the even case but now with odd degree g — 2 . One gets a denominator 
4{g — 2 ) ^ Ag this time. □ 


Remark 7.4. As a by-product of these estimates, applying Gallagher’s method we can derive a 
uniform version of his estimate ini for the number En{N) of monic polynomials f = ^ 

Z[T] of degree n ^ 1 with height max|aj| ^ N and Galois group strictly smaller than ©„, 
namely 

(7.3) En{N) <C n^{2N + N) 

with an absolute implied constant (note {2N + 1)” is the number of polynomials with height 
^ N). 

This is much more impressive than our Theorem lb.2l because the gain in the exponent (namely, 
is independent of the degree n so the bound is non-trivial for n as large as (2A^)^/‘^/(log A^). 
Similarly for reciprocal polynomials, as treated in |DDSj . denoting by £m{N) the number of 
monic polynomials in / £ Z[T] of degree 2m with height ^ N such that = f we get 

£m{N) < m2(2iV + l)"^-V2(iog AT) 

with an absolute implied constant. 

Note that in the two papers quoted, the fundamental large-sieve inequality is not uniform in 
n (resp. m) as stated, but becomes so if one replaces it by the form given in iHni Th. 1], with 
some obvious changes. For instance in m eq. (5)], the term [N^ in the right-hand side 

must be replaced by {^2N + 1 -|- To make this innocuous one may take x = n~^^/2N + 1 
instead of x = ^/N, which leads to 7r(x)“^ <C n{2N + l)“^'^^(log A^), hence the “extra” power 
of n in (dSl) compared with the “density” of irreducible polynomials modulo £ (i.e, about n 
which are used to sieve the reducible ones. 


8. Proof of the uniform version of Chavdarov’s theorem 

We can now start the proof of Theorem 16.21 itself. We will apply Proposition 13.31 with the 
following data: in addition to U, which is a smooth geometrically connected affine curve over 
Fg, we take the sheaves Ti = R^f\F^ for £ G A, where A is the set of odd primes 

These sheaves are of course obtained by reduction modulo £ from the compatible system Eg, = 
R^f\7i£. The existence of the symplectic pairing implies that the arithmetic monodromy 
group of T(, can be seen as a subgroup of SSp{2g, F^), and for any u G U (Fg), the image of Fr^ 
has multiplicator q. 

The most crucial point is that for £ > 2, J-K. Yu (unpublished) has shown that the geometric 
monodromy group for Ti is equal to Sp{2g, F^). Then the sheaves (Ei) are also linearly disjoint 
as a consequence of Goursat’s Lemma (see Gorollarv l2.6[ (1)), and by Lemma l2.3l we have (tO) 
with K = 2. And finally. Lemma [231 (3) gives us (|2.4I) with s = 2g^ + g + l, t = g + l and 

Cl = 1, C2 = 69 . 

Thus all conditions needed to apply Proposition (231 (in the case of a one-parameter family) 
are valid, and it remains to set up the sieving problem. The principle for this is exactly the 
same as the one introduced by Gallagher for polynomials with integer coefficients and bounded 
height jUj. 
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As in Lemma na for any choice of sets Q{£) C Tg^i defined for £ > 2 we let 
n{£) = {g e SSp{2g,Fi) \ m{g) = q, and deg(l -Tg) E 0(£)}. 

Applying Proposition 13.31 Isee (I.S.3|) i to such a sieving problem, we have 

(8.1) |5([/,L!;L)| ^ {2q + igqy\6L)^)PiL)-\ 
where A = 2g‘^ + 3g/2 + 5/2 and 

(8.2) P{L) = ^ co{i)\Gf\-\ 

2<e^L 

(which shouldn’t be confused with the polynomials Pu), and here we have taken the constant 
C = 4:g hy looking at Proposition 14.11 and (15.411 . (|5.bl) since 1 — xiU, Q^) = ^g and it is known 
that all the sheaves are tamely ramified (by |KS11 Lemma 10.1.12]) so that the contribution 
w of the Swan conductors in (j4.1 II vanishes. Moreover, by Ivemma, 17.21 we have 

(8.3) P(L)^ (I;(f)(f + l)-3. 

Now we must show how to use this sieve estimate to study the characteristic polynomials Pu- 
For this we need to recall the following two facts, the first of which is classical, while the second 
is much deeper: 

(i) if / E Z[r] is a polynomial of degree d that factorizes in F£[T] as a product of coprime 
polynomials /i • • • /r, with /j irreducible of degree deg fi = di, then the Galois group of /, seen 
as a permutation group of the complex roots of /, contains a cycle c of type (di,..., dr), i.e., a 
product of disjoint cycles of respective length di, dr (see e.g. |vdWl §61], |Jl p. 302]). 

(ii) the reduction modulo a prime £ of a polynomial Pu (the numerator of the zeta function 
of the curve Cu = '7r“^(u)) is the characteristic polynomial of Fr„ acting on Ti (see |D2[ Fonctions 
L mod. P^], or use the fact that 

det(l -TFtu 1 = det(l - TFr ] H^(Cu,Ze)) = Pu 

by (jnU), and reduce modulo i). 

Thus (ii) allows us to control the reduction of a polynomial Pu, while (i) tells us that the 
reduction gives information on the Galois group of Pu- 

In particular, for any sieving sets Q{£) C SSp{2g,F£), an element u E S{U,id',L) will have 
the property that the Galois group of Pu, seen as a subgroup of & 2 g, does not contain a cycle 
c associated to an / E fl(^), where £ ranges over primes 2 < i ^ L- 

If we have finitely many sieving sets flj, 1 ^ ^ m, defined by the condition that the cycle 

associated to det(l — Tg) is in a certain set Cj of conjugacy classes, and if moreover those c* 
have the property that the only subgroup W C W 2 g containing an element of each Cj is W 2 g, 
then it follows that the set of exceptional u E UiFq) with Pu having small Galois group will be 
a subset of the union of the S{U, i}i',L)- So in such a situation we have 

(8.4) N{U/Fq)!^S{U,nr,L) + --- + S{U,nm]L)^{2q + 4gq^/^L^) ^ R{L)-\ 

Tyemma, [7.1 I describes three possible choices of sets c*; however, the first and the second involve 
some Cj which are “too small”, so the dependency on g in the estimate for Pc{L) is bad (they 
are perfectly suitable for fixed g)- Thus we use Case (hi) of Lemma |23 Precisely, we have 
m = 4 and the four sets can be described as follows: 

(i) rii is the set of irreducible polynomials / E Tg,i- 

(ii) 0,2 is the set of polynomials / E Tg,i which factorize as a product of an irreducible 
quadratic polynomial and a product of irreducible polynomials of odd degrees. 

To define and 124, we recall that any / E Tg,i can be written uniquely 

/ = T^h{qT + T-^) 

where h ^Fi\T] is a monic polynomial of degree g- 
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(iii) ^3 is the set of / G such that the corresponding h has an irreducible factor of 
prime degree > gj 2 . 

(iv) ^4 is the set of / G Tg^£ such that the corresponding h has a single quadratic irreducible 
factor and no other irreducible factor of even degree. 


We claim that those sets do allow us to sieve the exceptional elements u. Indeed, spelling 
out again the relation between the factorization of Pu modulo £ and the existence of elements 
in the Galois group of Pu with the associated cycle type, we see that: 

(i) If Pu is reducible then u G S{U, Q,i] L). 

(ii) If Pu is irreducible but the Galois group W does not contain a transposition, then 

u G S(U,i} 2 ', L), since having P„(modI') G implies that W contains an element with 

cycle type consisting of one 2 -cycle and further cycles of odd length, a power of which will be 
a transposition. 

For the next two facts, notice that the cycle in &g associated to the polynomial Qu such that 
Pu = T3Qu{qT + T~^) is the image by the map p : W 2 g —> &g of the cycle associated to Pu- 

(iii) If Pu is irreducible but p(W) does not contain a cycle of prime order m > g/2, then 

u G s{u,n 3 ;L). 

(iv) If Pu is irreducible but p{W) does not contain a transposition, then u G S{U,Q 4 ; L) 
(as in Gase (ii) previously). 

By Gase (iii) of Lemma l7.ll the u G I^(Fg) that we wish to exclude are therefore in the union 
of the L), and we conclude that 

(8.5) N{U/Fg)^S{U,nr,L) + --- + S{U,nr,L)^4{2q + 4gq^^\6L)^)(mm ^ Pi(L))”\ 


It remains to give appropriate lower bounds of Pi{L). For fls and 114 , since the correspondence 
between polynomials / G Tg^i and the h G F£[T] such that / = T^h{qT + T~^) is one-to-one, 
we can count the corresponding h by Lemma Eni applied to the cycle types (i.e., conjugacy 
classes) in &g associated to the polynomials in flj. For i > Ag^ and i G {3,4}, denoting by Ci 
the set of elements in &g having the associated cycle type, we get 




s. 


and thus for L > 4^^ we have 




IS 


4c/2<£<L 


Vi 


i + l) V 


By the mean-value theorem we have for any £ ^ 2 

= 1 - ghii) + 0{g^h{if) 

with 

2 1 2 
+ 7 !" VW+Ty 

and an absolute implied constant. Inserting this in the sum and using the prime number theorem 
we get for L > 4g‘^ that 


( 8 . 6 ) 


Pi{L) ^ y;^{7r(L) +0{gVL{logL) ^ + g'^ loglog L)}, 


with an absolute implied constant. 

By na p. 99] (where our C 3 is denoted P and is denoted T), we have iov g ^ 1 


(8.7) 


ISJ 


> 


log 2 g 


^ l<^4| 1 

and 7 :^ 


ISn 


V9 
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Using this gives the lower bounds 

( 8 . 8 ) PsiL) » -i_L(logL)-i, and P^iL) » ^L(logL)-i 

log 2g ^ 

with absolute implied constants for L ^ ^^(log 2g) (i.e. for L ^ aig'^(log 2g), where the absolute 
constant ai can be specified from the implied constants in (IHSl) and (|HII)). 

Coming to fli, we have by (ii) of Lemma ESI that for I > Ag'^ 

so by (IH31), the prime number theorem and the mean-value theorem as before we get for L > 4,g‘^ 
that 

-Pi(U) > + 0{g\og\ogL + g^ + y/Z)) 

with an absolute implied constant, and hence for L 3> g'^{\og2g), we have 

(8.9) Pi(L)>-L(logL)-i 

9 

with absolute implied constant. 

Finally by (iii) of Lemma 17.81 we have for i > Ag'^ 

and P 2 {L) > ^(^^(L) -h 0(5 log log L 5 ^)) 

and for L ^ 5 ^ (log 2g) we obtain also 

(8.10) P2(U) >-L(logL)-i 

9 

with absolute implied constant. 

Altogether from (|8.5|1 . (18.81) . (18.911 and (18.101) we get 

N{U/Fg) « g\2q + q^/\6L)^)L-\logL) 

with an absolute implied constant, which can in fact be chosen so that the inequality is valid 
for all L ^ 2 and 5 ^ 1 , since it becomes trivial for g^ ^ L(logL)“^. Choosing 6 L = = 

qi^a +3g+5) ^ with log L <C 5 “^ log q, this gives the announced uniform estimate 

N{U/F,) «gi-^(logg) 

with 7 = ( 45 ^ -|- 85 + 5)“^, and an absolute implied constant. 

9. Proof of the general version of Chavdarov’ Theorem 

We will now quickly prove Theorem EH only highlighting the points where the proof is 
different from that of the previous section. The first step is to check that we can always apply 
Proposition 18.81 to the data consisting of U/Fq and the family of sheaves = R^'k\F^, defined 
for a subset A of primes i > Lq. 

Since our assumption is that for i > Lq the geometric monodromy group of R^tt\Zi is the 
symplectic group Sp{2g) (as algebraic group over Q^), we must show that this implies that the 
monodromy group modulo i is often large. (A priori, for fixed i, the assumption only implies 
that the index of the image of Tri{U,ri) in Sp{2g,Zi/PZi) is bounded for ^ 1, but does not 
say anything for = 1). However, we can appeal to a result of Larsen |La[ Th. 8.17] which 
implies that for a set of primes Aq of (natural) density 1, we do have = Sp{2g,Fi) because 
the sheaves come from a compatible system. (Precisely, in the notation of loc. cit., apply Th. 
3.17 with Q = 7 ri([/,r/), p£ corresponding to i?^ 7 r!Z£, so that by assumption Gi = Sp{2g)/Qi, 
which is connected and simply connected so = Sp{2g), and look at the first few lines of 
the proof of Th. 3.17 to make sure that the statement there involving “hyperspecial maximal 
compact subgroups” does imply that the geometric monodromy group of the reduction of pi is 
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Sp{2g,Y() for ^ in a set of density 1; note also that Larsen’s result is quite deep as it depends 
on the classification of simple finite groups). 

Then, as before, Corollary [221 implies that the sheaves Ti for £ G Aq are linearly disjoint in 
all cases (if £ ^ 5) by the assumption on the geometric monodromy groups, and (|2.3j) holds with 
K = 2 by Lemma 1221 (2). 

Since Ti is obtained by reduction of the compatible system Ti = case (i) of Propo¬ 

sition is applicable if [/ is a curve, with A consisting of all the primes in Aq. 

If U is not a curve but p > 2g + 1, we use the following simple lemma (compare |K51 Lemma 
7.5.1]) : 

Lemma 9.1. Let r ^ 1. For any p > r + 1, there exists a G {^jpTL)^ such that the order of 
GL{r, F^) is prime to p for any prime £ = a (modp). 

Proof. The order of GL(r, F^) is 

|GL(r,F^)| Yl (f-1) 

so the condition will hold whenever the order of a modulo pis>r. Ifp>r + 1, a primitive 
root modulo p will certainly work. □ 

For p > 2g + 1, we can apply the second case of Theorem 13.11 and Proposition 13.31 with A 
consisting of primes in Aq which are congruent modulo p to the a given by this lemma for 
r = 2g. This set has positive density among the primes because Aq has density 1. 

We can now define sieving sets analogous to the previous flj, 1 ^ f ^ 4, for C/F^ and we 
have (|8.5I) . Since we consider g to be fixed here, we can rewrite (1H21) as 

Pi(L)» ^ Uiiiy-s 

where the implied constant depends on g. Then we obtain 

P,{L) > 7r(L) 

for L > Lq, the implied constant depending on g and p in Case (ii) (through the density of A), 
either from Lemma o or directly from Dedekind’s formula used in its proof. 

Hence we obtain by Proposition 13.31 

N{U/¥q) « (g'^ + Cq‘^-^/^L^)L-\\ogL), 

for L > Lq, where the implied constant depends on g, and on p in Case (ii). If we take 
L = g(V+39+5)-i^ ^ ^ ^(10g2+69+8)-i^ 

we have hence 

N{U/¥q) < q'^L-\\ogL) < q^-^{\ogq) 

T' = 4gW3g+5 ^ = lOgWeg+s )^ desired. 

The proof of Corollary 16.41 is similar, applying first the large sieve to a suitably rigidified 
moduli space AgjYq of principally polarized abelian varieties over Fg, for instance the moduli 
space Ag^^c of |KSIl 11.3]. Strictly speaking we need to restrict to a smooth connected affine 
subscheme U C Ag^^c, but this is not a problem as observed in the remarks after Theorem 13.1 1 
(see Remark 4). Over U we have a universal family vr : Ag^sc U and we take the sheaves 
= PfixxfLi and their reductions TnliTi. The monodromy groups are as large as possible 
because already this is the case for the families of (canonically principally polarized) jacobians 
of the hyperelliptic curves of genus g of Theorem 16.21 After applying Proposition 13.31 to the 
same sieving problem as in Theorem 16.1 L we go (with the same saving) from the number of 
“exceptional” principally polarized abelian varieties with a 3£-structure to the number N{g, q) 
by dividing out by the free rigidifying parameters and considering the situations with extra 
automorphisms, as done in [KS11 11.3] for the case of curves. 
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